When Do Birds of a Feather Flock Together? K-Means, Proximity, and Conic Programming

نویسندگان

  • Xiaodong Li
  • Yang Li
  • Shuyang Ling
  • Thomas Strohmer
  • Ke Wei
چکیده

Given a set of data, one central goal is to group them into clusters based on some notion of similarity between the individual objects. One of the most popular and widely-used approaches is K-means despite the computational hardness to find its global minimum. We study and compare the properties of different convex relaxations by relating them to corresponding proximity conditions, an idea originally introduced by Kumar and Kannan. Using conic duality theory, we present an improved proximity condition under which the Peng-Wei relaxation of K-means recovers the underlying clusters exactly. Our proximity condition improves upon Kumar and Kannan, and is comparable to that of Awashti and Sheffet where proximity conditions are established for projective K-means. In addition, we provide a necessary proximity condition for the exactness of the Peng-Wei relaxation. For the special case of equal cluster sizes, we establish a different and completely localized proximity condition under which the Amini-Levina relaxation yields exact clustering, thereby having addressed an open problem by Awasthi and Sheffet in the balanced case. Our framework is not only deterministic and model-free but also comes with a clear geometric meaning which allows for further analysis and generalization. Moreover, it can be conveniently applied to analyzing various data generative models such as the stochastic ball models and Gaussian mixture models. With this method, we improve the current minimum separation bound for the stochastic ball models and achieve the state-of-the-art results of learning Gaussian mixture models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Does "birds of a feather flock together" matter - Evidence from a longitudinal study on US-China scientific collaboration

China’s status as a scientific power, particularly in the emerging area of nanotechnology, has become widely accepted in the global scientific community. The role of knowledge spillover in China’s nanotechnology development is generally assumed, albeit without much convincing evidence. Very little has been investigated on the different mechanisms of knowledge spillover. Utilizing both cross-sec...

متن کامل

Gender and Personality in Media Rich Interfaces : Do Birds of a Feather Flock Together ?

This research explores how user and interface characteristics can interact to influence decision performance. Specifically, this research examines the effects of gender, personality similarity, and increased levels of information cues on user involvement with a computer-based decision aid. In addition, this research explores the downstream effects of user involvement on decision time, effort, s...

متن کامل

Do birds of a feather universally flock together? Cultural variation in the similarity-attraction effectajsp_

Three experiments explored the similarity-attraction effect (SAE) among North American and Japanese samples. In all studies, North Americans showed a significantly more pronounced SAE than the Japanese. The North Americans consistently revealed a strong SAE whereas the Japanese effect was only significant in the methods with the most power. The cultural differences emerged across different meth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017